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Full text available: < P| pdfif 646.67 KB) 



The rapid development of Internet has resulted in more and more multimedia in Web 
content. However, due to the limitation in the bandwidth and huge size of the multimedia 
data, users always suffer from long time waiting. On the other hand, if we can predict the 
web object or page that the user most likely will view next while the user is viewing the 
current page, and pre-fetch the content, then the perceived network latency can be 
significantly reduced. In this paper, we present an n-gram bas ... 
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The World Wide Web can be considered as a large distributed information system that 
provides access to shared data objects. As one of the most popular applications currently 
running on the Internet, the World Wide Web is of an exponential growth in size, which 
results in network congestion and server overloading. Web caching has been recognized as 
one of the effective schemes to alleviate the service bottleneck and reduce the network 
traffic, thereby minimize the user access latency. In this pap ... 
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With the exponential growth of hosts and traffic workloads on the Internet, collaborative 
web caching has been recognized as an efficient solution to alleviate web page server 
bottlenecks and reduce traffic. However, cache discovery, i.e., locating where a page is 
cached, is a challenging problem, especially in the fast growing World Wide Web 
environment, where the number of participating proxies can be very large. In this paper, 
we propose a new scheme which employs proxy affinities to mai ... 
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In this paper, we study the dynamics of the MSNBC news site, one of the busiest Web sites 
in the Internet today. Unlike many other efforts that have analyzed client accesses as seen 
by proxies, we focus on the server end. We analyze the dynamics of both the server 
content and client accesses made to the server. The former considers the content creation 
and modification process while the latter considers page popularity and locality in client 
accesses. Some of our key results are: (a) files ... 
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January 2000 ACM SIGKDD Explorations Newsletter, volume l issue 2 

Full text available: ^.p.df(l..MMB). Additional Information: M .citation, abstract, refeiences, citings 

Web usage mining is the application of data mining techniques to discover usage patterns 
from Web data, in order to understand and better serve the needs of Web-based 
applications. Web usage mining consists of three phases, namely preprocessing, pattern 
discovery, and pattern analysis. This paper describes each of these phases in detail. Given 
its application potential, Web usage mining has seen a rapid increase in interest, from both 
the research and practice communities. This pap ... 

Keywords: data mining, web usage mining, world wide web 
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A WebView is a web page automatically created from base data typically stored in a DBMS. 
Given the multi-tiered architecture behind database-backed web servers, we have the 
option of materializing a WebView inside the DBMS, at the web server, or not at all, always 
computing it on the fly (virtual). Since WebViews must be up to date, materialized 
WebViews are immediately refreshed with every update on the base data. In this paper we 
compare the three materialization policies (materializ ... 
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Recent advances in wireless data networking and portable information appliances have 
engendered a new paradigm of computing, called mobile computing, in which users carrying 
portable devices have access to data and information services regardless of their physical 
location or movement behavior. In the meantime, research addressing information access 
in mobile environments has proliferated. In this survey, we provide a concrete framework 
and categorization of the various way ... 

Keywords: application adaptation, cache invalidation, caching, client/server, data 
dissemination, disconnected operation, mobile applications, mobile client/server, mobile 
compuing, mobile data, mobility awareness, survey, system application 
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terms 

We developed a user interface that organizes Web search results into hierarchical 
categories. Text classification algorithms were used to automatically classify arbitrary 
search results into an existing category structure on-the-fly. A user study compared our 
new category interface with the typical ranked list interface of search results. The study 
showed that the category interface is superior both in objective and subjective measures. 
Subjects liked the category interface much better than t ... 

Keywords: World Wide Web, classification, search, support vector machine, text 
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Publisher Site 

Providing the infrastructure that supports the WorM-Wide Web is expensive. The costs 
incurred in running a web site include those associated with the content being served; 
those associated with the hardware that supports the site; and the network costs incurred 
in transmitting that content to the end consumers. In this work we examine mechanisms 
for compressing web content so as to reduce the third of these three costs, and describe a 
scheme that exploits the known connectivities between web pag ... 

17 Supporting quality of service in HTTP servers 
Raju Pandey, J. Fritz Barnes, Ronald Olsson 

June 1998 Proceedings of the seventeenth annual ACM symposium on Principles of 
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In a wireless communications network, the movement of mobile users presents significant 
technical challenges to providing efficient access to the wired broadband network. In this 
paper, we construct a new analytical/numerical model that characterizes mobile user 
behavior and the resultant traffic patterns. The model is based on a semi-Markov process 
representation of mobile user behavior in a general state-space. Using a new algorithm for 
parameter estimation of a general Hidden Semi-Markov ... 
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terms 

The characteristics of the data traffic generated by the use of micro-browser-enabled PCS 
phones to gain access to the Web is of particular interest to cellular network operators. 
Questions such as the frequency and length of browser sessions, and the specific 
characteristic of the traffic generated, need to be answer by researchers. These answers 
are valuable in network capacity planning as more subscribers use their cellular phones to 
interact with the Web. 
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We developed and evaluated seven interfaces for integrating semantic category information 
with Web search results. List interfaces were based on the familiar ranked-listing of search 
results, sometimes augmented with a category name for each result. Category interfaces 
also showed page titles and/or category names, but re-organized the search results so that 
items in the same category were grouped together visually. Our user studies show that all 
Category interfaces were more effective than ... 

Keywords: World Wide Web, focus-in-context, search, text categorization, usability, user 
interface, user study 
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